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I Ave. 



ST Microelectronics Pvt. Ltd., an Indian company, of Plot No. 2 & 3, 
Sector 16A 9 Institutional Area, Noicla - 201 3001, Uttar Pradesh, India. 

hereby declare - 

(a) that I am/we re in possession of an invention titled "Improved 
Mapping Of Programmable Logic Devices. " 

(b) that the provisional / complete specification relating to this invention is 
filed with this application 



further declare that the inventor(s) for the said inventions is/are 

(i) S HARM A Sunil Kumar, an Indian citizen, of A3 12, Sector 19 
Noida, U. P. 201 301, India. 

I/we claim the priority from the application(s) filed in convection countries, 
particulars of which are as follows: NA 

IAve state that the said invention is an improvement in or modification of the 
invention the particulars of which are as follows and of which IAve are the 
applicant/patentee: NIL 

IAve state that the application is divided out of my/our application, the particulars 
of which are given below and pray that this application be deemed to have been 
filed on under section 16 of the Act. NIL 

That I am/we are the assignee or legal representative of the true and first 
inventors. 

That my/our address for service in India is as follows: 

A NA NO & A NA Nil A d vacates 
11-4 7, Nizantuddin East 
New Delhi - 110 013 



(c) 



that there is no lawful ground of objection to the grant of a patent to 
me/us. 



Tel Nos.: (11) 4355078, 4355076, 4350360 
Fax Nos.: (11) 4354243, 4352060 
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I/We the true and first inventors of this invention or the applicant(s) in the 
convention country declare that the applicant* s) herein is/are my/our assignee 
or legal representative. 



a) 



Sunil Kumar Sharma an Indian National of A3 12 Sector 19 
NOIDA , UP-201 301 



Signature 




Dated this 



day of 2002 



1 0- that to the best of my/our knowledge, information and belief the fact and matters stated 
herein are correct and that there is no lawful ground of objection to grant of patent to 
me/us on this application. 

1 1- Following are the attachment with the application 

(a) Complete specification (3 copies) 

(b) Abstract 

(c) Formal drawings 

(d) Power of Attorney 

(e) Form 1 (in triplicate) 

(f) Form 3 ( in duplicate) 

(g) Fee Rs. 5000/- In cash/cheque/bank draft bearing no. , date 



Bank. 



I/We request that a patent may be granted to me/us for the said invention. 




The Controller of Patents 
The Patent Office, Delhi 
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Form 2 

THE PATENTS ACT, 1970 



COMPLETE SPECIFICATION 



[See Section 10J 



'IMPROVED MAPPING OF PROGRAMMABLE LOGIC DEVICES' 

I 

o 

A 



STMicroelectronics PvL Ltd., Plot No. 2 & 3, Sector 16A, Institutional Area, Noida - 201 
301, Uttar Pradesh, India, an Indian Company 



The following specification particularly describes and ascertains the nature of this invention 
and the manner in which it is to be performed. 
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IMPROVED MAPPING OF PROGRAMMABLE LOGIC DEVICES 
Field of the Invention 

cascade logic eleml whi^g * «" — * 

Background of the Invention 

^Wbb Lo^c Devices (PLDs) provide rae rf 

mapoine of Z , 7 Ca, " >a ^ C ° nf, « Urin « Solves the 

configured for desired ftinctionalitv The effiri™, «■ 

,«■. „ ... - cnonanty. me efficiency of mapping algorithms is critical to the 

effecttve utilization of PLDs and FPGAs In LIJT ror- a 

on the LUTs. LUT-based FPGAs the mapping is implemented 

The conventional LUT-based FPGA mappmg algorithms can be divided into two classes The 
agonthnts in the firs, Cass emphasize the minimizanon of the number T^TJZ 

decomposinon and bin pacing technics. The mgonthms in me second Cass emphal^ 
m_n of the delay of the solntion. This class indudes "F,^ by CongTdY 
Ding Which use fl„ w base, technics in mapping with node dup, Jfion ,o 
logical depth of the mapped netlist [2]. 

A OAG-map (Direct A„a Iy s,s Graph) method for FPGA technology mapping for delay 
optimization has been proposed by J Conn etal in n. u , 

, H *^ ° y J Lon 8 etaL HI- This method utilizes a graph based 

technology mapping algorithm "DAG M,>„» e, j i 

FPGA desinn, -n. . . ' y ° p,ln " za,ion » lookup-table based 

eZ 7 ^ ° U ' ,eChn<>l08y maPPing a " d **" on *. 

nr r a n r As a preprocess,n8 s,ep ,n " dag - m * p " * 

transforms an arbitrary ».,»p ut network int0 . corresponding 
increase ,„ the „ et wor k dep,h ; Fi„a„y, a graph matchmg based techmoue which ^ 
area optimizat.on without increasing the networa de,ay is used as a pos processing^* 
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"DAG-Map" This method does not however utilize the cascade elements available with each 
LUT in the FPGA. 

Another optimal technology mapping algorithm for delay optimisation in Lookup -Table 
based FPGA Designs has been proposed by Jason Cong et. al. \2\. This method proposes a 
polynomial time technology-mapping algorithm, called "Flow-Map", that optimally solves 
the LUT-based FPGA technology-mapping problem for depth minim.zation for general 
Boolean networks. A key step in "Flow-Map" is the computation of a minimum height K- 
feasible cut in a network, by network flow computation. This algorithm does effectively 
minimize the number of LUTs by maximizing the volume of each cut and by several post 
processing operations but it does not utilize the cascade elements with the LUT to further 
reduce the size of the logic. 

A method for On Area/Depth Trade-off in LUT-Based FPGA Technology mapping has been 
disclosed in reference [3] by Jason Cong and Yuzheng Ding. In this method the area and 
depth trade off in LUT based FPGA technology mapping is proposed by performing a 
number of depth relaxation operations to obtain a new network with bounded increase in 
depth and advantageous for subsequent re-mapping for area minimization. The resulting 
network is then re-mapped to obtain an area-minimized mapping solution. By gradually 
increasing the depth bound for each design a set of mapping solutions with smooth area and 
depth trade-off is achieved. For the area minimization step, an optimal algorithm for 
computing an area-minimum mapping solution without node duplication is developed. 
However this method also does not talk about the area minimization by utilizing the cascade 
elements with each LUT. 

Another method proposed by Jason Cong and Yuzheng Ding [4\ proposes an integrated 
approach to synthesis and mapping that extends the combinatorial limit set up by the depth- 
optimal "Flow Map" algorithm. The new algorithm, "FlowSYN", uses global combinatorial 
optimisation techniques to guide the Boolean synthesis process during depth minimization. 
The combinatorial optimisation is achieved by computing a series of minimum cuts of fixed 
heights in a network based on fast network flow computation, and the Boolean optimisation 
is achieved by efficient OBDD-based implementation of functional decomposition. This 
method also does not utilize cascade elements. 
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The Object and Summary of the Invention 

The object of this invention is to provide an a 
circuits by utilizing cascade elements in LUT-based FPGAs 



. 2^^°"*°* 1S t0 — » - efncentlv syn thes, 2 in g electron, 



Yet another object of the invention is to nmviH. . -- .^.j . .. . . 

minimum depth in LUT-based FPGAs. ' S ^ an * a <*™* ™th 

Device fPI ni Xk. ... j P <LUT) based Programmable Logic 

Dev.ce (PLD). The method operates by selecting an unmapped or partially mapped LOT Z 
■denting a group of circuit Cements for mapping on the JTed I""^ 
avarlable capacity of me .elected LUT and the mapping constmim, The^n^ l! 
*-» am then mapping onto the selected LUT. The identification of cn^ZZd 

selected LUT. The process ,s continued until all the circuit Cements have been mapped The 

Sits: Turi taco ™ oniy ^ — - ^ — ■ ~ 

LUTs have beTu^i Z tT ^ ^ ^ """^ ■» 

— .i^i: mappra8 ~ ,nc,ude — — • 



Brief Description of the Drawings 

shows a 
(FPGA) 

Figure 2 shows the FPGA circuit implementation 



Figure 1 shows schematic diagram of a conventional Field Programmable Gate Array 



process, according to this invention. 
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Figure 3 shows the flow diagram of the technology-mapping step in the FPGA circuit 
implementation process, according to this invention. 

Figure 4 shows the process flow diagram for LUT mapping in the technology-mapping 
step in the FPGA circuit implementation process, according to this invention. 

Figure 5 shows the schematic diagram of a cascade element coupled with the LUT. 

Figure 6 shows the schematic diagram of a typical four input logic cone. 

Figure 7 shows a conventional net list mapped logic circuit without using the cascade 
feature of a programmable logic device. 

Figure 8 shows a netlist mapping using the cascade feature of the programmable logic 
device in accordance with the present invention. 

Detailed Description 

The following description is based on the definition of technical terms given below: 
Electronic design the logical structure of an electronic device such as an 

integrated circuit This may be specified either as a 
behavioural description, as high-level Boolean equations, a 
circuit schematic or in any other form representing the 
logical arrangements of a device. It may include different 
constraints such as timing constraints, placement constraints, 
or mapping constraints etc. 



Target hardware device the hardware device on which an electronic design is 

implemented. In context of this invention, a target hardware 
device typically includes the symmetric array of 
uncommitted logic elements. The uncommitted elements 
consist of the LUT, cascade gate, MUXes and flip-flops. 
These elements are grouped to form another hierarchy called 
programmable logic blocks, which are again grouped to form 
a programmable logic device. 
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Mapping 



° mP ' ^ S0ftware and hardware on which the software operates for 

compiling the electronics design. The function of the 
compiler is to synthesize the netlist and map the netlist 
(design) to the target device. 

refers to the process of grouping gates from a gate netlist or 
other hardware independent representation of logic into a 
logic block. In other words the logic design may be divided 
imo clusters representing the various logic functions within 
the design. These clusters are mapped onto the uncommitted 
logic elements in the programmable logic device during the 
compilation of the electronic design. The conditions for 
grouping (mapping) the gates into logic cells is that it should 
be possible to implement the grouped gates in the logic cell, 
For example, if a four input LUT is taken as a logic cell then 
the grouped gates must have less than or equal to four inputs 
for successful mapping. 

The present invention utiles the cascade dements durmg the HDL compiiauon and ntaps 
the design for programmable integrated circuits daring technology mapping. 

to «he aecompanytng diagrams (figure 6, figure 7, figure 8) a hexagon represents a LUT a 
ctrde represents a logic node, an anew represents a connection, and a rectengle represents' a 
cascade element. 

Figure 1 is a schematic b.ock diagram of a typical LUT-based Field Programmable Gate 
Array. A typical FPGA has vertical/horizontal routing tines 1, an array of logic blocks 2 and 
tmerfacing I/O pads 3. The routing resources ■ connect the logic block elements 2 and I/O 
pads. The FPGA can also have switch boxes a, the intersection of routing lines for connecting 
to the logic block arrays. 

Figure 2 shows the flow chat. 100 of an electronic design compilation process. The process 
of compdation is started , ,0 by either choking ,cons or by passtng the command to start the 
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compilation. Once the compilation is started the design entries 120 are entered by the user 
according to which the circuit is synthesized 130. The LUTs and the PLBs are then mapped 
140 over the synthesized circuit. After proper placement of the elements and routing lines 
150 the configuration bit is generated and the logic is then configured on the FPGA ^The 
compiler operates in accordance with user specifications. The user spectficafions can be in 
terms of timing requirements, area constraints or any other desired constraints. The compiler 
synthesizes the design to produce a net list, which describes the functionality of design for 
implementation on the programmable logic device. The net list can be the collection of gates, 
state machines & macros etc. The nodes of net list are connected via nets and each net has a 
signal associated with it. The net list is synthesized to remove the redundant logic, and to 
meet specified constraints. 

Figure 3 shows a simplified block representation 200 of e technology mapping step of the 
electronic design compilation. The technology mapping can broadly be divided in two parts, 
LUT mapping 14$ and PLB packing 142. 
tv\» 

Figure 4 shows a flow chart of the LUT mapping process 300 according to^the present 
invention. In this process the first step is to decompose the GATE netlist^l into simple 
gates. The second step establishes the feasibility of pushing additional logic without violating 
the specified constraints including the fan-out constraints 330 in an LUT 340. If it is feasible 
to incorporate more logic onto the LUT then the logic is mapped accordingly 350, and the 
control are returns to block 330. If it is not feasible to incorporate any additional logic onto 
the LUT then the feasibility of incorporating the cascade elements is established. If this is 
possible then appropriate logic elements are mapped onto the cascade elements. If the 
possibility does not exist then another LUT is selected and the control are returns to block 
330. The logic clusters are formed by grouping nodes. Whenever the cluster is k-infeasible, 
no more nodes are added to the cluster. Before starting a fresh cluster, the feasibility of 
incorporating the cascade element in the device architecture is determined. If the feasibility is 
established the next node is mapped to the cascade element. This decreases the block count 
for the logic. As the cascade elements are hard wired they do not use the routing resources 
and the resultant delay is less than that of a LUT. 

Figure 5 shows the structure of a Programmable Logic Block (PLB) 400 in a LUT-based 
FPGA. LUT 410 having inputs 401, and cascade logic gate 420, is connected to multiplexer 
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430 having flip-flop 440 at its output. One of the inputs 413 «„ the cascade .ogic is ft™ , he 
cascade ou, of fte previons stage and fte output 43, of the cascade gate is the cascade in for 
the next stage 

Figure 6 show the schematic diagram of a typical four input logic cone 600. The nodes 601 
602, 603, 604 are mputs from other logic cones or logic gates. Nodes 610, 620, 640 are logic' 
gate, and the arrow represents the connection between the logic gates. 

Figure 7 shows the schematic diagram 700 of a mapped nettist without using the cascade 
feature of the programmable logic device. Nodes 701, 702, 703, 704 etc. are the inputs from 
the other logic cones or logic gates, nodes 710, 720, etc. are the logic gates, and the arrow 
represents the connection between the logic gates. 

Figure 8 shows the flow chart for the mapping of a netlist 800 using the cascade logic of the 
programmable logic device according to the current invention. The mapping is implemented 
according to predefined criteria. The criteria may be logic density, speed etc. In each case 
forming of cones xs started from nodes e.g. 810, 820, 860 etc. whose fanins are already 
mapped or from the nodes which are primary mputs e.g. 801, 802, 811 etc. The maximum 
possible number of nodes is mapped onto each available LUT to increase the density of logic 
Each node required to be mapped to an LUT is selected according to the speed criteria If it is 
not possmle to map any more nodes to an LUT, the feasibility of mapping the node to the 
cascade element 890 of the LUT is established. Ifthe feasibility exists then the density of the 
log* implemented is mcreased as more logic of the same logic block is utilized At the same 
time the speed of the implemented circuit also increases as the cascade elements 890 use 
hardwired connecting lines instead of the general routing resources of the FPGA. 

Another method that can be applied for extracting the cascade chains is: 

1. Prior to forming the LUTs the nodes (logic gates) to be mapped to the cascade elements 
are identified. 

2- The entire netlist is then mapped to the LUTs without considering the cascade logic 
elements. 
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3. The identified nodes are then extracted from the mapped list as a post operation and 
mapped onto the cascade logic elements. 

Since the cascade element is a single universal gate (NAND or NOR) certain constraints have 
be observed while mapping gates onto the cascade element. These constraints can be divided 
into two groups. 

1 . Conditions that are to be verified when starting a new chain. 

2. Constraints to be verified while the cascade chain is being formed. 

The first group of conditions that need to be verified when starting a new cascade chain are: 

1. The number of common inputs to the fan in LUTs of the cascade element should be 
more than four. 

2. The gate implemented in cascade element should not be of type XOR, XNOR or NOT. 

3. Either one of the fan in gates should be in the Cascade Element or none should be in 
cascade element. 

4. Either the gate implemented in the Cascade Element or its input LUTs should be multi 
fan out, but not both. 

5. If the output of the cascade element is a primary output then the gate implemented 
inside the cascade element should not be of type 'AND' or 'NOR' gate. 

The second set of conditions that are needed to be verified while the cascade chain is being 
formed are: 

1. The gate implemented in cascade element should not be of type XOR, XNOR or 
NOT. 



9 



02-IND-l 19 

2. 



3. 



Z: n ; 0 r fan ,n gate shou,d - in °— b — - - — - * - 



4. If the caput of the cascade element is a primary outpu, then .he gate implemented 
the cascade element should not be of type 'AND' or 'NOR'. 



in 



6. 



architecture ts the independent of me algorithm used for LUT synthesis This m 

I. will be apparent to those with ordinary skm in ^ «, ^ fc 

Hustranve mtended to be exhaustive or bmiriug, havin g been presented by way o eZL 

Accordingly, «. mvention is „„, , Q ^ ^ ^ 

for purposes of disdosure, bu, rather ,„ cover all changes and modificanons *I ^To 
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We claim: 



An proved ^memod for mapping „ ^ 
selecting an unmapped or partially mapped LUT 

identifying a gronp of cireni, elements for mapping based on me ava„a bI e 
capacity of me selected LUT and the mapping consents, 

mapping the eroun of rimi* ^„ . 

~ .., „ m ^uiciiis onto the selected LUT and 

continuing the process of selecting an LUT fn™,™ ' 

^IZZZ T Io8,c ~ M,h - ~* - 

megTup. ^° ff ^ 8 * e ^ 0f ^^»d^ mapping of 



mapped to the cascade logic prior to mapping on the LUTs. 

An improved method as claimed in claim 1 wherein the cascade h~ ■ 
only a*er etther an circuit eiemems have ,m ti aUy he " £ 

An improved method as claimed in claim 1 wherein the m,™ 

incorporate one or more of the f , ^ °" °'~ de ' 0g,C 

or more of the following constraints depending imon ,h, 
connechvrty of the architecture: Penning upon the 

- XOR, XNOR and NOT Amotions are no, mapped on the cascade logic 

- omy „„ of ither the ^ ^ ^ or ^ • 

have multiple fan-outs, 

- if the outpu, of .he c _ de logjc „ , primaiy ^ ^ ^ 

onto it is not an AND' or 'NOR' gate, 

tf the mapped gale has maniple fan outs men the outputs are no, connected ,o 
more man one other ga,e mapped i„,o a cascade logic elemen,, and 
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if the mapped gate connects to the output of a multi-fan out LUT then the 
output of the LUT is not connected to more than one cascade logic element. 

6. An improved method as claimed in claim 1 including the verification of one or more 
of the following conditions at the initial mapping of the cascade logic chain 
depending upon the connections of the architecture: 

the number of common inputs to the fan-in LUTs of the cascade logic is not 

greater than the number of inputs of the LUT, 

the gate mapped onto the cascade logic is not of the type XOR, XNOR or 
NOT, and 

only one of either the gates mapped on top the cascade logic or its input LUTs 
is multi fen. 

7. An improved system for mapping an electronic digital circuit to a Look up table 
(LUT) based Programmable Logic Device (PLD) comprising: 

selecting means for selecting an unmapped or partially mapped LUT, 
grouping means for clustering circuit elements for mapping based on the 
available capacity of the selected LUT and the mapping constraints, 
mapping means for mapping the group of circuit elements onto the selected 
LUT, and 

characterized in that, the grouping means and mapping means include the mapping 
of cascade logic associated with the selected LUT. 

8. An improved method for mapping an electronic digital circuit to a Look Up table 
(LUT) based Programmable Logic Device (PLD) substantially as herein described 
with reference to and as illustrated in figures 2 to 6&8 of the accompanying drawings. 

9. An improved system for mapping an electronic digital circuit to a Look up table 
(LUT) based Programmable Logic Device (PLD) substantially as herein described 
with reference to and as illustrated in figures 2 to 6&8 of the accompanying drawings. 

Dated this^l^ay of SOrfe^bO, , 2002 ^ , T- 
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ABSTRACT 

n ■ • 2 T *&EL 

This invention provides an improved method for mapping an electronic digital circuit to _ 
Look Up table (LUT) based Programmable Logic Device (PLD). The method operates by 
selecting an unmapped or partially mapped LUT, and identifying a group of circuit elements 
for mapping on the selected LUT based on the available capacity of the selected LUT and the 
mapping constraints. The identified circuit elements are then mapping onto the selected LUT. 
The identification of circuit elements and mapping is carried out while taken into 
consideration the Cascade Logic associated with the selected LUT. The process is continued 
until all the circuit elements have been mapped. The group of circuit elements is mapped to 
the cascade logic prior to mapping on the LUTs. Conversely, the cascade logic is 
incorporated only after either all circuit elements have initially been mapped onto LUTs or 
some circuit elements remain unmapped even after all LUTs have been utilized. The mapping 
constraints include timing constraints, placement constraints, and size constraints. 
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